Overview

Dataset statistics

Number of variables17
Number of observations21045
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 MiB
Average record size in memory136.0 B

Variable types

Categorical2
Numeric15

Alerts

Date is highly correlated with YRMODAHRMI and 2 other fieldsHigh correlation
Time is highly correlated with HourHigh correlation
Altitude is highly correlated with Location and 3 other fieldsHigh correlation
YRMODAHRMI is highly correlated with Date and 2 other fieldsHigh correlation
Hour is highly correlated with TimeHigh correlation
Humidity is highly correlated with Location and 3 other fieldsHigh correlation
AmbientTemp is highly correlated with Month and 4 other fieldsHigh correlation
PolyPwr is highly correlated with Humidity and 1 other fieldsHigh correlation
Pressure is highly correlated with Location and 5 other fieldsHigh correlation
Month is highly correlated with Date and 3 other fieldsHigh correlation
Location is highly correlated with Latitude and 5 other fieldsHigh correlation
Latitude is highly correlated with Location and 3 other fieldsHigh correlation
Longitude is highly correlated with Location and 3 other fieldsHigh correlation
Season is highly correlated with Date and 3 other fieldsHigh correlation
Cloud.Ceiling is highly correlated with LocationHigh correlation
Wind.Speed has 1787 (8.5%) zeros Zeros

Reproduction

Analysis started2022-10-18 15:25:17.856507
Analysis finished2022-10-18 15:26:11.696929
Duration53.84 seconds
Software versionpandas-profiling v3.3.0
Download configurationconfig.json

Variables

Location
Categorical

HIGH CORRELATION

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size164.5 KiB
Travis
2746 
Peterson
2640 
USAFA
2573 
Hill Weber
2384 
March AFB
2204 
Other values (7)
8498 

Length

Max length11
Median length9
Mean length7.285863626
Min length4

Characters and Unicode

Total characters153331
Distinct characters36
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCamp Murray
2nd rowCamp Murray
3rd rowCamp Murray
4th rowCamp Murray
5th rowCamp Murray

Common Values

ValueCountFrequency (%)
Travis2746
13.0%
Peterson2640
12.5%
USAFA2573
12.2%
Hill Weber2384
11.3%
March AFB2204
10.5%
JDMT1779
8.5%
Malmstrom1517
7.2%
Grissom1487
7.1%
Camp Murray1113
5.3%
Kahului941
 
4.5%
Other values (2)1661
7.9%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
travis2746
10.3%
peterson2640
9.9%
usafa2573
9.6%
hill2384
8.9%
weber2384
8.9%
march2204
8.2%
afb2204
8.2%
jdmt1779
 
6.7%
malmstrom1517
 
5.7%
grissom1487
 
5.6%
Other values (5)4828
18.1%

Most occurring characters

ValueCountFrequency (%)
r15204
 
9.9%
e10048
 
6.6%
s9877
 
6.4%
a9634
 
6.3%
A8130
 
5.3%
i7558
 
4.9%
M7393
 
4.8%
l7226
 
4.7%
t5919
 
3.9%
5701
 
3.7%
Other values (26)66641
43.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter97727
63.7%
Uppercase Letter49903
32.5%
Space Separator5701
 
3.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r15204
15.6%
e10048
10.3%
s9877
10.1%
a9634
9.9%
i7558
7.7%
l7226
7.4%
t5919
 
6.1%
o5644
 
5.8%
m5634
 
5.8%
u3876
 
4.0%
Other values (8)17107
17.5%
Uppercase Letter
ValueCountFrequency (%)
A8130
16.3%
M7393
14.8%
F4777
9.6%
T4525
9.1%
P2640
 
5.3%
S2573
 
5.2%
U2573
 
5.2%
W2384
 
4.8%
H2384
 
4.8%
G2267
 
4.5%
Other values (7)10257
20.6%
Space Separator
ValueCountFrequency (%)
5701
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin147630
96.3%
Common5701
 
3.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
r15204
 
10.3%
e10048
 
6.8%
s9877
 
6.7%
a9634
 
6.5%
A8130
 
5.5%
i7558
 
5.1%
M7393
 
5.0%
l7226
 
4.9%
t5919
 
4.0%
o5644
 
3.8%
Other values (25)60997
41.3%
Common
ValueCountFrequency (%)
5701
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII153331
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r15204
 
9.9%
e10048
 
6.6%
s9877
 
6.4%
a9634
 
6.3%
A8130
 
5.3%
i7558
 
4.9%
M7393
 
4.8%
l7226
 
4.7%
t5919
 
3.9%
5701
 
3.7%
Other values (26)66641
43.5%

Date
Real number (ℝ≥0)

HIGH CORRELATION

Distinct500
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20177196.32
Minimum20170523
Maximum20181004
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size164.5 KiB

Quantile statistics

Minimum20170523
5-th percentile20170709
Q120171110
median20180317
Q320180623
95-th percentile20180903
Maximum20181004
Range10481
Interquartile range (IQR)9513

Descriptive statistics

Standard deviation4579.585358
Coefficient of variation (CV)0.0002269683699
Kurtosis-1.584172181
Mean20177196.32
Median Absolute Deviation (MAD)411
Skewness-0.6362495955
Sum4.246290965 × 1011
Variance20972602.05
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2018040683
 
0.4%
2018051975
 
0.4%
2018072275
 
0.4%
2018060974
 
0.4%
2018050272
 
0.3%
2018072172
 
0.3%
2018060271
 
0.3%
2018051871
 
0.3%
2018081169
 
0.3%
2018081669
 
0.3%
Other values (490)20314
96.5%
ValueCountFrequency (%)
201705234
 
< 0.1%
201705245
 
< 0.1%
2017052520
0.1%
2017052613
0.1%
2017052718
0.1%
2017052813
0.1%
2017052912
0.1%
2017053014
0.1%
2017053111
0.1%
201706019
< 0.1%
ValueCountFrequency (%)
2018100413
 
0.1%
2018100311
 
0.1%
2018100233
0.2%
2018100122
0.1%
2018093024
0.1%
2018092932
0.2%
2018092849
0.2%
2018092747
0.2%
2018092653
0.3%
2018092552
0.2%

Time
Real number (ℝ≥0)

HIGH CORRELATION

Distinct24
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1267.483725
Minimum1000
Maximum1545
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size164.5 KiB

Quantile statistics

Minimum1000
5-th percentile1000
Q11100
median1300
Q31400
95-th percentile1500
Maximum1545
Range545
Interquartile range (IQR)300

Descriptive statistics

Standard deviation167.6027667
Coefficient of variation (CV)0.1322326775
Kurtosis-1.198996556
Mean1267.483725
Median Absolute Deviation (MAD)115
Skewness-0.09593808745
Sum26674195
Variance28090.6874
MonotonicityNot monotonic
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
15003223
15.3%
13003168
15.1%
14003073
14.6%
12003034
14.4%
11002743
13.0%
10002513
11.9%
1030224
 
1.1%
1315223
 
1.1%
1330215
 
1.0%
1445215
 
1.0%
Other values (14)2414
11.5%
ValueCountFrequency (%)
10002513
11.9%
1015103
 
0.5%
1030224
 
1.1%
1045117
 
0.6%
11002743
13.0%
1115140
 
0.7%
1130199
 
0.9%
1145169
 
0.8%
12003034
14.4%
1215195
 
0.9%
ValueCountFrequency (%)
1545146
 
0.7%
1530181
 
0.9%
1515174
 
0.8%
15003223
15.3%
1445215
 
1.0%
1430207
 
1.0%
1415201
 
1.0%
14003073
14.6%
1345208
 
1.0%
1330215
 
1.0%

Latitude
Real number (ℝ≥0)

HIGH CORRELATION

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38.21382324
Minimum20.89
Maximum47.52
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size164.5 KiB

Quantile statistics

Minimum20.89
5-th percentile26.98
Q138.16
median38.95
Q341.15
95-th percentile47.52
Maximum47.52
Range26.63
Interquartile range (IQR)2.99

Descriptive statistics

Standard deviation6.323760959
Coefficient of variation (CV)0.1654835979
Kurtosis0.9838233675
Mean38.21382324
Median Absolute Deviation (MAD)2.2
Skewness-0.9860827074
Sum804209.91
Variance39.98995266
MonotonicityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
38.162746
13.0%
38.822640
12.5%
38.952573
12.2%
41.152384
11.3%
33.92204
10.5%
26.981779
8.5%
47.521517
7.2%
40.671487
7.1%
47.111113
5.3%
20.89941
 
4.5%
Other values (2)1661
7.9%
ValueCountFrequency (%)
20.89941
 
4.5%
26.981779
8.5%
33.92204
10.5%
38.162746
13.0%
38.822640
12.5%
38.952573
12.2%
40.671487
7.1%
41.13881
 
4.2%
41.152384
11.3%
44.89780
 
3.7%
ValueCountFrequency (%)
47.521517
7.2%
47.111113
5.3%
44.89780
 
3.7%
41.152384
11.3%
41.13881
 
4.2%
40.671487
7.1%
38.952573
12.2%
38.822640
12.5%
38.162746
13.0%
33.92204
10.5%

Longitude
Real number (ℝ)

HIGH CORRELATION

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-108.5936778
Minimum-156.44
Maximum-80.11
Zeros0
Zeros (%)0.0%
Negative21045
Negative (%)100.0%
Memory size164.5 KiB

Quantile statistics

Minimum-156.44
5-th percentile-122.57
Q1-117.26
median-111.18
Q3-104.71
95-th percentile-80.11
Maximum-80.11
Range76.33
Interquartile range (IQR)12.55

Descriptive statistics

Standard deviation16.3641299
Coefficient of variation (CV)-0.1506913683
Kurtosis1.428597136
Mean-108.5936778
Median Absolute Deviation (MAD)6.47
Skewness-0.5501023125
Sum-2285353.95
Variance267.7847474
MonotonicityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
-121.562746
13.0%
-104.712640
12.5%
-104.832573
12.2%
-111.992384
11.3%
-117.262204
10.5%
-80.111779
8.5%
-111.181517
7.2%
-86.151487
7.1%
-122.571113
5.3%
-156.44941
 
4.5%
Other values (2)1661
7.9%
ValueCountFrequency (%)
-156.44941
 
4.5%
-122.571113
5.3%
-121.562746
13.0%
-117.262204
10.5%
-111.992384
11.3%
-111.181517
7.2%
-104.832573
12.2%
-104.712640
12.5%
-95.75881
 
4.2%
-93.2780
 
3.7%
ValueCountFrequency (%)
-80.111779
8.5%
-86.151487
7.1%
-93.2780
 
3.7%
-95.75881
 
4.2%
-104.712640
12.5%
-104.832573
12.2%
-111.181517
7.2%
-111.992384
11.3%
-117.262204
10.5%
-121.562746
13.0%

Altitude
Real number (ℝ≥0)

HIGH CORRELATION

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean798.8436683
Minimum1
Maximum1947
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size164.5 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median458
Q31370
95-th percentile1947
Maximum1947
Range1946
Interquartile range (IQR)1368

Descriptive statistics

Standard deviation770.6817943
Coefficient of variation (CV)0.9647467018
Kurtosis-1.504816817
Mean798.8436683
Median Absolute Deviation (MAD)457
Skewness0.4117204744
Sum16811665
Variance593950.428
MonotonicityNot monotonic
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
12746
13.0%
22720
12.9%
18792640
12.5%
19472573
12.2%
13702384
11.3%
4582204
10.5%
10431517
7.2%
2391487
7.1%
841113
5.3%
380881
 
4.2%
ValueCountFrequency (%)
12746
13.0%
22720
12.9%
841113
5.3%
2391487
7.1%
246780
 
3.7%
380881
 
4.2%
4582204
10.5%
10431517
7.2%
13702384
11.3%
18792640
12.5%
ValueCountFrequency (%)
19472573
12.2%
18792640
12.5%
13702384
11.3%
10431517
7.2%
4582204
10.5%
380881
 
4.2%
246780
 
3.7%
2391487
7.1%
841113
5.3%
22720
12.9%

YRMODAHRMI
Real number (ℝ≥0)

HIGH CORRELATION

Distinct18
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.01771807 × 1011
Minimum2.01705 × 1011
Maximum2.0181 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size164.5 KiB

Quantile statistics

Minimum2.01705 × 1011
5-th percentile2.01707 × 1011
Q12.01711 × 1011
median2.01803 × 1011
Q32.01806 × 1011
95-th percentile2.01809 × 1011
Maximum2.0181 × 1011
Range105000000
Interquartile range (IQR)95000000

Descriptive statistics

Standard deviation45798457.77
Coefficient of variation (CV)0.0002269814522
Kurtosis-1.58412901
Mean2.01771807 × 1011
Median Absolute Deviation (MAD)4000000
Skewness-0.6362630047
Sum4.246287679 × 1015
Variance2.097498734 × 1015
MonotonicityNot monotonic
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
2.01807 × 10111818
 
8.6%
2.01805 × 10111815
 
8.6%
2.01808 × 10111805
 
8.6%
2.01806 × 10111707
 
8.1%
2.01804 × 10111540
 
7.3%
2.01801 × 10111382
 
6.6%
2.01803 × 10111343
 
6.4%
2.01711 × 10111260
 
6.0%
2.01809 × 10111138
 
5.4%
2.01712 × 10111137
 
5.4%
Other values (8)6100
29.0%
ValueCountFrequency (%)
2.01705 × 1011110
 
0.5%
2.01706 × 1011637
3.0%
2.01707 × 10111111
5.3%
2.01708 × 10111130
5.4%
2.01709 × 10111106
5.3%
2.0171 × 1011824
3.9%
2.01711 × 10111260
6.0%
2.01712 × 10111137
5.4%
2.01801 × 10111382
6.6%
2.01802 × 10111103
5.2%
ValueCountFrequency (%)
2.0181 × 101179
 
0.4%
2.01809 × 10111138
5.4%
2.01808 × 10111805
8.6%
2.01807 × 10111818
8.6%
2.01806 × 10111707
8.1%
2.01805 × 10111815
8.6%
2.01804 × 10111540
7.3%
2.01803 × 10111343
6.4%
2.01802 × 10111103
5.2%
2.01801 × 10111382
6.6%

Month
Real number (ℝ≥0)

HIGH CORRELATION

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.565882632
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size164.5 KiB

Quantile statistics

Minimum1
5-th percentile1
Q14
median7
Q39
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.983958121
Coefficient of variation (CV)0.4544641274
Kurtosis-0.7240518019
Mean6.565882632
Median Absolute Deviation (MAD)2
Skewness-0.121414139
Sum138179
Variance8.90400607
MonotonicityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
82935
13.9%
72929
13.9%
62344
11.1%
92244
10.7%
51925
9.1%
41540
7.3%
11382
6.6%
31343
6.4%
111260
6.0%
121137
 
5.4%
Other values (2)2006
9.5%
ValueCountFrequency (%)
11382
6.6%
21103
 
5.2%
31343
6.4%
41540
7.3%
51925
9.1%
62344
11.1%
72929
13.9%
82935
13.9%
92244
10.7%
10903
 
4.3%
ValueCountFrequency (%)
121137
 
5.4%
111260
6.0%
10903
 
4.3%
92244
10.7%
82935
13.9%
72929
13.9%
62344
11.1%
51925
9.1%
41540
7.3%
31343
6.4%

Hour
Real number (ℝ≥0)

HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.62784509
Minimum10
Maximum15
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size164.5 KiB

Quantile statistics

Minimum10
5-th percentile10
Q111
median13
Q314
95-th percentile15
Maximum15
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.67295201
Coefficient of variation (CV)0.1324811951
Kurtosis-1.207316578
Mean12.62784509
Median Absolute Deviation (MAD)1
Skewness-0.09257268679
Sum265753
Variance2.798768427
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
133814
18.1%
153724
17.7%
143696
17.6%
123603
17.1%
113251
15.4%
102957
14.1%
ValueCountFrequency (%)
102957
14.1%
113251
15.4%
123603
17.1%
133814
18.1%
143696
17.6%
153724
17.7%
ValueCountFrequency (%)
153724
17.7%
143696
17.6%
133814
18.1%
123603
17.1%
113251
15.4%
102957
14.1%

Season
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size164.5 KiB
Summer
8208 
Spring
4808 
Fall
4407 
Winter
3622 

Length

Max length6
Median length6
Mean length5.581183179
Min length4

Characters and Unicode

Total characters117456
Distinct characters14
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowWinter
2nd rowWinter
3rd rowWinter
4th rowWinter
5th rowWinter

Common Values

ValueCountFrequency (%)
Summer8208
39.0%
Spring4808
22.8%
Fall4407
20.9%
Winter3622
17.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
summer8208
39.0%
spring4808
22.8%
fall4407
20.9%
winter3622
17.2%

Most occurring characters

ValueCountFrequency (%)
r16638
14.2%
m16416
14.0%
S13016
11.1%
e11830
10.1%
l8814
7.5%
i8430
7.2%
n8430
7.2%
u8208
7.0%
p4808
 
4.1%
g4808
 
4.1%
Other values (4)16058
13.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter96411
82.1%
Uppercase Letter21045
 
17.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r16638
17.3%
m16416
17.0%
e11830
12.3%
l8814
9.1%
i8430
8.7%
n8430
8.7%
u8208
8.5%
p4808
 
5.0%
g4808
 
5.0%
a4407
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
S13016
61.8%
F4407
 
20.9%
W3622
 
17.2%

Most occurring scripts

ValueCountFrequency (%)
Latin117456
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
r16638
14.2%
m16416
14.0%
S13016
11.1%
e11830
10.1%
l8814
7.5%
i8430
7.2%
n8430
7.2%
u8208
7.0%
p4808
 
4.1%
g4808
 
4.1%
Other values (4)16058
13.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII117456
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r16638
14.2%
m16416
14.0%
S13016
11.1%
e11830
10.1%
l8814
7.5%
i8430
7.2%
n8430
7.2%
u8208
7.0%
p4808
 
4.1%
g4808
 
4.1%
Other values (4)16058
13.7%

Humidity
Real number (ℝ≥0)

HIGH CORRELATION

Distinct10356
Distinct (%)49.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.12194073
Minimum0
Maximum99.98779
Zeros52
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size164.5 KiB

Quantile statistics

Minimum0
5-th percentile6.26465
Q117.5293
median33.12378
Q352.59399
95-th percentile84.254148
Maximum99.98779
Range99.98779
Interquartile range (IQR)35.06469

Descriptive statistics

Standard deviation23.82301129
Coefficient of variation (CV)0.641750157
Kurtosis-0.2626667736
Mean37.12194073
Median Absolute Deviation (MAD)17.08374
Skewness0.6652577812
Sum781231.2427
Variance567.535867
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.98779200
 
1.0%
052
 
0.2%
11.0351611
 
0.1%
9.2956511
 
0.1%
17.547619
 
< 0.1%
29.205328
 
< 0.1%
21.307378
 
< 0.1%
16.796888
 
< 0.1%
37.774668
 
< 0.1%
15.417488
 
< 0.1%
Other values (10346)20722
98.5%
ValueCountFrequency (%)
052
0.2%
0.00611
 
< 0.1%
0.024411
 
< 0.1%
0.030521
 
< 0.1%
0.048832
 
< 0.1%
0.073241
 
< 0.1%
0.164791
 
< 0.1%
0.17091
 
< 0.1%
0.195311
 
< 0.1%
0.231931
 
< 0.1%
ValueCountFrequency (%)
99.98779200
1.0%
99.957281
 
< 0.1%
99.938961
 
< 0.1%
99.932861
 
< 0.1%
99.926761
 
< 0.1%
99.890141
 
< 0.1%
99.859621
 
< 0.1%
99.841311
 
< 0.1%
99.798581
 
< 0.1%
99.737553
 
< 0.1%

AmbientTemp
Real number (ℝ)

HIGH CORRELATION

Distinct5259
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.28511702
Minimum-19.98177
Maximum65.73837
Zeros0
Zeros (%)0.0%
Negative264
Negative (%)1.3%
Memory size164.5 KiB

Quantile statistics

Minimum-19.98177
5-th percentile7.0961
Q121.91528
median30.28915
Q337.47467
95-th percentile49.00322
Maximum65.73837
Range85.72014
Interquartile range (IQR)15.55939

Descriptive statistics

Standard deviation12.36682047
Coefficient of variation (CV)0.4222902871
Kurtosis0.1613385729
Mean29.28511702
Median Absolute Deviation (MAD)7.75703
Skewness-0.3264691886
Sum616305.2876
Variance152.9382486
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
30.3118119
 
0.1%
39.3075619
 
0.1%
28.4713718
 
0.1%
26.4798717
 
0.1%
37.4469817
 
0.1%
37.4771917
 
0.1%
36.2082717
 
0.1%
31.0394317
 
0.1%
34.904117
 
0.1%
29.0554817
 
0.1%
Other values (5249)20870
99.2%
ValueCountFrequency (%)
-19.981771
< 0.1%
-18.672561
< 0.1%
-18.670041
< 0.1%
-18.644871
< 0.1%
-18.589481
< 0.1%
-18.06581
< 0.1%
-17.997821
< 0.1%
-17.307971
< 0.1%
-16.759111
< 0.1%
-16.076811
< 0.1%
ValueCountFrequency (%)
65.738371
< 0.1%
64.502181
< 0.1%
64.436721
< 0.1%
63.905491
< 0.1%
63.862691
< 0.1%
63.215641
< 0.1%
62.583691
< 0.1%
62.482991
< 0.1%
61.9871
< 0.1%
61.981961
< 0.1%

PolyPwr
Real number (ℝ≥0)

HIGH CORRELATION

Distinct18804
Distinct (%)89.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.97858273
Minimum0.25733
Maximum34.28502
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size164.5 KiB

Quantile statistics

Minimum0.25733
5-th percentile2.14695
Q16.40457
median13.7987
Q318.86365
95-th percentile23.634938
Maximum34.28502
Range34.02769
Interquartile range (IQR)12.45908

Descriptive statistics

Standard deviation7.12325543
Coefficient of variation (CV)0.548846941
Kurtosis-1.082213033
Mean12.97858273
Median Absolute Deviation (MAD)5.94676
Skewness-0.03534716445
Sum273134.2735
Variance50.74076792
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3.1663718
 
0.1%
2.7793715
 
0.1%
2.8849215
 
0.1%
0.8093714
 
0.1%
4.363312
 
0.1%
3.2015512
 
0.1%
3.6245612
 
0.1%
2.9552812
 
0.1%
2.7441912
 
0.1%
3.0256410
 
< 0.1%
Other values (18794)20913
99.4%
ValueCountFrequency (%)
0.257331
< 0.1%
0.270261
< 0.1%
0.272341
< 0.1%
0.281522
< 0.1%
0.285121
< 0.1%
0.287011
< 0.1%
0.30131
< 0.1%
0.307841
< 0.1%
0.30991
< 0.1%
0.314051
< 0.1%
ValueCountFrequency (%)
34.285021
< 0.1%
34.269631
< 0.1%
33.586731
< 0.1%
33.578871
< 0.1%
33.241921
< 0.1%
33.149191
< 0.1%
32.089351
< 0.1%
32.074231
< 0.1%
31.983071
< 0.1%
31.763291
< 0.1%

Wind.Speed
Real number (ℝ≥0)

ZEROS

Distinct40
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.31831789
Minimum0
Maximum49
Zeros1787
Zeros (%)8.5%
Negative0
Negative (%)0.0%
Memory size164.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q16
median9
Q314
95-th percentile22
Maximum49
Range49
Interquartile range (IQR)8

Descriptive statistics

Standard deviation6.385029998
Coefficient of variation (CV)0.6188053195
Kurtosis0.5282312904
Mean10.31831789
Median Absolute Deviation (MAD)4
Skewness0.6270857455
Sum217149
Variance40.76860808
MonotonicityNot monotonic
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
01787
 
8.5%
81660
 
7.9%
71633
 
7.8%
91557
 
7.4%
61544
 
7.3%
101467
 
7.0%
51408
 
6.7%
111302
 
6.2%
31204
 
5.7%
131181
 
5.6%
Other values (30)6302
29.9%
ValueCountFrequency (%)
01787
8.5%
11
 
< 0.1%
212
 
0.1%
31204
5.7%
51408
6.7%
61544
7.3%
71633
7.8%
81660
7.9%
91557
7.4%
101467
7.0%
ValueCountFrequency (%)
492
 
< 0.1%
471
 
< 0.1%
431
 
< 0.1%
411
 
< 0.1%
404
 
< 0.1%
392
 
< 0.1%
383
 
< 0.1%
375
< 0.1%
364
 
< 0.1%
3411
0.1%

Visibility
Real number (ℝ≥0)

Distinct25
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.700071276
Minimum0
Maximum10
Zeros68
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size164.5 KiB

Quantile statistics

Minimum0
5-th percentile8
Q110
median10
Q310
95-th percentile10
Maximum10
Range10
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.351948651
Coefficient of variation (CV)0.1393751254
Kurtosis27.27661223
Mean9.700071276
Median Absolute Deviation (MAD)0
Skewness-5.144762537
Sum204138
Variance1.827765154
MonotonicityNot monotonic
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
1019668
93.5%
9.1231
 
1.1%
7193
 
0.9%
8177
 
0.8%
6146
 
0.7%
5123
 
0.6%
4111
 
0.5%
395
 
0.5%
068
 
0.3%
262
 
0.3%
Other values (15)171
 
0.8%
ValueCountFrequency (%)
068
0.3%
0.18
 
< 0.1%
0.313
 
0.1%
0.45
 
< 0.1%
0.512
 
0.1%
0.62
 
< 0.1%
0.813
 
0.1%
0.98
 
< 0.1%
136
0.2%
1.311
 
0.1%
ValueCountFrequency (%)
1019668
93.5%
9.1231
 
1.1%
8.82
 
< 0.1%
8177
 
0.8%
7193
 
0.9%
6.91
 
< 0.1%
6.25
 
< 0.1%
6146
 
0.7%
5123
 
0.6%
4111
 
0.5%

Pressure
Real number (ℝ≥0)

HIGH CORRELATION

Distinct934
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean925.944747
Minimum781.7
Maximum1029.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size164.5 KiB

Quantile statistics

Minimum781.7
5-th percentile799.5
Q1845.5
median961.1
Q31008.9
95-th percentile1019.2
Maximum1029.5
Range247.8
Interquartile range (IQR)163.4

Descriptive statistics

Standard deviation85.21565875
Coefficient of variation (CV)0.09203104077
Kurtosis-1.558003544
Mean925.944747
Median Absolute Deviation (MAD)56.8
Skewness-0.3588775394
Sum19486507.2
Variance7261.708497
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1010.9130
 
0.6%
1009.6128
 
0.6%
1010.6123
 
0.6%
959.8122
 
0.6%
1009.9120
 
0.6%
959.2117
 
0.6%
1009.2113
 
0.5%
1008.9110
 
0.5%
1018.2107
 
0.5%
1011.6107
 
0.5%
Other values (924)19868
94.4%
ValueCountFrequency (%)
781.71
 
< 0.1%
782.36
< 0.1%
782.51
 
< 0.1%
782.84
< 0.1%
783.11
 
< 0.1%
783.43
 
< 0.1%
783.63
 
< 0.1%
783.98
< 0.1%
784.23
 
< 0.1%
784.51
 
< 0.1%
ValueCountFrequency (%)
1029.52
 
< 0.1%
1029.41
 
< 0.1%
1029.22
 
< 0.1%
1029.13
< 0.1%
1028.82
 
< 0.1%
1028.74
< 0.1%
1028.51
 
< 0.1%
1028.47
< 0.1%
1028.21
 
< 0.1%
1028.13
< 0.1%

Cloud.Ceiling
Real number (ℝ≥0)

HIGH CORRELATION

Distinct79
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean515.9667855
Minimum0
Maximum722
Zeros4
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size164.5 KiB

Quantile statistics

Minimum0
5-th percentile16
Q1140
median722
Q3722
95-th percentile722
Maximum722
Range722
Interquartile range (IQR)582

Descriptive statistics

Standard deviation301.9033793
Coefficient of variation (CV)0.5851217323
Kurtosis-1.252785313
Mean515.9667855
Median Absolute Deviation (MAD)0
Skewness-0.8224788989
Sum10858521
Variance91145.65044
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
72214256
67.7%
250588
 
2.8%
200226
 
1.1%
60215
 
1.0%
50192
 
0.9%
55186
 
0.9%
100168
 
0.8%
110150
 
0.7%
150137
 
0.7%
70132
 
0.6%
Other values (69)4795
 
22.8%
ValueCountFrequency (%)
04
 
< 0.1%
110
 
< 0.1%
273
0.3%
343
0.2%
449
0.2%
557
0.3%
661
0.3%
781
0.4%
897
0.5%
972
0.3%
ValueCountFrequency (%)
72214256
67.7%
30035
 
0.2%
250588
 
2.8%
24073
 
0.3%
23053
 
0.3%
22042
 
0.2%
21042
 
0.2%
200226
 
1.1%
19061
 
0.3%
18083
 
0.4%

Interactions

Correlations

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

LocationDateTimeLatitudeLongitudeAltitudeYRMODAHRMIMonthHourSeasonHumidityAmbientTempPolyPwrWind.SpeedVisibilityPressureCloud.Ceiling
0Camp Murray20171203114547.11-122.57842.017120e+111211Winter81.7199712.869192.42769510.01010.6722
1Camp Murray20171203131547.11-122.57842.017120e+111213Winter96.649179.664152.46273010.01011.323
2Camp Murray20171203133047.11-122.57842.017120e+111213Winter93.6157215.449834.46836510.01011.632
3Camp Murray20171204123047.11-122.57842.017120e+111212Winter77.2155810.366591.6536452.01024.46
4Camp Murray20171204141547.11-122.57842.017120e+111214Winter54.8034716.854716.5793933.01023.79
5Camp Murray20171204143047.11-122.57842.017120e+111214Winter47.1008318.123632.9202705.01023.7722
6Camp Murray20171205111547.11-122.57842.017120e+111211Winter43.5546919.412693.4128404.01025.7722
7Camp Murray20171205120047.11-122.57842.017120e+111212Winter30.5664123.909304.8202057.01026.0722
8Camp Murray20171205130047.11-122.57842.017120e+111213Winter17.9077132.323465.98127510.01025.7722
9Camp Murray20171205140047.11-122.57842.017120e+111214Winter14.4043035.412674.96121610.01025.4722

Last rows

LocationDateTimeLatitudeLongitudeAltitudeYRMODAHRMIMonthHourSeasonHumidityAmbientTempPolyPwrWind.SpeedVisibilityPressureCloud.Ceiling
21035USAFA20180928133038.95-104.8319472.018090e+11913Fall15.8325235.4680616.154871510.0803.1722
21036USAFA20180928140038.95-104.8319472.018090e+11914Fall13.2629440.6142415.750551610.0802.8722
21037USAFA20180928144538.95-104.8319472.018090e+11914Fall11.0351644.4688414.000451410.0802.3722
21038USAFA20180928150038.95-104.8319472.018090e+11915Fall11.7309643.8520112.917391610.0802.3722
21039USAFA20180928151538.95-104.8319472.018090e+11915Fall11.4685143.8142411.529551410.0802.3722
21040USAFA20180928153038.95-104.8319472.018090e+11915Fall11.6699243.225109.796111410.0802.3722
21041USAFA20180929130038.95-104.8319472.018090e+11913Fall18.2251028.9824710.889921310.0799.2722
21042USAFA20180929140038.95-104.8319472.018090e+11914Fall15.5212433.491678.244791010.0798.4722
21043USAFA20180929150038.95-104.8319472.018090e+11915Fall6.6345251.6216312.473281010.0797.8722
21044USAFA20181001140038.95-104.8319472.018100e+111014Fall22.5830132.839586.397321510.0801.2110